Overview

Dataset statistics

Number of variables11
Number of observations672
Missing cells215
Missing cells (%)2.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory57.9 KiB
Average record size in memory88.2 B

Variable types

NUM9
CAT2

Reproduction

Analysis started2020-06-24 06:14:12.130909
Analysis finished2020-06-24 06:14:32.089941
Duration19.96 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

Current Account Balance has 31 (4.6%) missing values Missing
Public debt has 21 (3.1%) missing values Missing
sovereign yields has 30 (4.5%) missing values Missing
GDP Growth has 23 (3.4%) missing values Missing
VIX Index has 12 (1.8%) missing values Missing
Public Deficit has 24 (3.6%) missing values Missing
Unemployment Rate has 14 (2.1%) missing values Missing
Credit To Private Sector has 60 (8.9%) missing values Missing
quarter is uniformly distributed Uniform
country is uniformly distributed Uniform

Variables

year
Real number (ℝ≥0)

Distinct count14
Unique (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2006.5
Minimum2000
Maximum2013
Zeros0
Zeros (%)0.0%
Memory size5.2 KiB

Quantile statistics

Minimum2000
5-th percentile2000
Q12003
median2006.5
Q32010
95-th percentile2013
Maximum2013
Range13
Interquartile range (IQR)7

Descriptive statistics

Standard deviation4.034131578
Coefficient of variation (CV)0.002010531561
Kurtosis-1.212394406
Mean2006.5
Median Absolute Deviation (MAD)3.5
Skewness0
Sum1348368
Variance16.27421759
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2013487.1%
 
2012487.1%
 
2011487.1%
 
2010487.1%
 
2009487.1%
 
2008487.1%
 
2007487.1%
 
2006487.1%
 
2005487.1%
 
2004487.1%
 
Other values (4)19228.6%
 
ValueCountFrequency (%) 
2000487.1%
 
2001487.1%
 
2002487.1%
 
2003487.1%
 
2004487.1%
 
ValueCountFrequency (%) 
2013487.1%
 
2012487.1%
 
2011487.1%
 
2010487.1%
 
2009487.1%
 

quarter
Categorical

UNIFORM

Distinct count4
Unique (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
4
168
3
168
2
168
1
168
ValueCountFrequency (%) 
416825.0%
 
316825.0%
 
216825.0%
 
116825.0%
 

Length

Max length1
Median length1
Mean length1
Min length1

country
Categorical

UNIFORM

Distinct count12
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
Spain
 
56
Ireland
 
56
France
 
56
Italy
 
56
Luxembourg
 
56
Other values (7)
392
ValueCountFrequency (%) 
Spain568.3%
 
Ireland568.3%
 
France568.3%
 
Italy568.3%
 
Luxembourg568.3%
 
Austria568.3%
 
Belgium568.3%
 
Portugal568.3%
 
Netherlands568.3%
 
Finland568.3%
 
Other values (2)11216.7%
 

Length

Max length11
Median length7
Mean length7.166666667
Min length5

Current Account Balance
Real number (ℝ)

MISSING

Distinct count640
Unique (%)99.8%
Missing31
Missing (%)4.6%
Infinite0
Infinite (%)0.0%
Mean0.00976726006240254
Minimum-16.39411
Maximum17.80625
Zeros0
Zeros (%)0.0%
Memory size5.2 KiB

Quantile statistics

Minimum-16.39411
5-th percentile-10.80689
Q1-3.43053
median-0.05751
Q34.138015
95-th percentile9.451803
Maximum17.80625
Range34.20036
Interquartile range (IQR)7.568545

Descriptive statistics

Standard deviation6.136273915
Coefficient of variation (CV)628.2492609
Kurtosis0.01997580974
Mean0.009767260062
Median Absolute Deviation (MAD)3.827609
Skewness-0.05346822329
Sum6.2608137
Variance37.65385756
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2.520.3%
 
-1.23433210.1%
 
6.84407310.1%
 
-10.210.1%
 
1.8919310.1%
 
1.6420310.1%
 
6.46002110.1%
 
4.31913710.1%
 
3.310.1%
 
16.7867910.1%
 
Other values (630)63093.8%
 
(Missing)314.6%
 
ValueCountFrequency (%) 
-16.3941110.1%
 
-16.0500210.1%
 
-15.410.1%
 
-15.2496910.1%
 
-14.5430510.1%
 
ValueCountFrequency (%) 
17.8062510.1%
 
17.4083610.1%
 
16.7867910.1%
 
16.665910.1%
 
16.3291110.1%
 

Public debt
Real number (ℝ≥0)

MISSING

Distinct count482
Unique (%)74.0%
Missing21
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean0.7022105990783409
Minimum0.055
Maximum1.7503
Zeros0
Zeros (%)0.0%
Memory size5.2 KiB

Quantile statistics

Minimum0.055
5-th percentile0.0715
Q10.491
median0.663
Q30.9855
95-th percentile1.213
Maximum1.7503
Range1.6953
Interquartile range (IQR)0.4945

Descriptive statistics

Standard deviation0.3274936551
Coefficient of variation (CV)0.4663752662
Kurtosis-0.02399620642
Mean0.7022105991
Median Absolute Deviation (MAD)0.22
Skewness0.2456493039
Sum457.1391
Variance0.1072520941
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.03471.0%
 
0.66350.7%
 
0.06150.7%
 
0.0650.7%
 
0.6640.6%
 
1.07740.6%
 
0.66840.6%
 
0.57540.6%
 
0.66240.6%
 
0.06340.6%
 
Other values (472)60590.0%
 
(Missing)213.1%
 
ValueCountFrequency (%) 
0.05520.3%
 
0.05610.1%
 
0.05720.3%
 
0.05920.3%
 
0.0650.7%
 
ValueCountFrequency (%) 
1.750310.1%
 
1.726510.1%
 
1.720310.1%
 
1.674110.1%
 
1.640210.1%
 

sovereign yields
Real number (ℝ≥0)

MISSING

Distinct count575
Unique (%)89.6%
Missing30
Missing (%)4.5%
Infinite0
Infinite (%)0.0%
Mean4.505939252336448
Minimum1.34
Maximum25.4
Zeros0
Zeros (%)0.0%
Memory size5.2 KiB

Quantile statistics

Minimum1.34
5-th percentile2.2615
Q13.63825
median4.2115
Q34.9625
95-th percentile6.16715
Maximum25.4
Range24.06
Interquartile range (IQR)1.32425

Descriptive statistics

Standard deviation2.211515751
Coefficient of variation (CV)0.49080017
Kurtosis38.14778138
Mean4.505939252
Median Absolute Deviation (MAD)0.648
Skewness5.153862714
Sum2892.813
Variance4.890801917
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
4.12530.4%
 
3.0530.4%
 
4.34830.4%
 
4.1830.4%
 
3.96330.4%
 
4.2130.4%
 
3.39820.3%
 
4.11420.3%
 
4.06920.3%
 
4.13420.3%
 
Other values (565)61691.7%
 
(Missing)304.5%
 
ValueCountFrequency (%) 
1.3410.1%
 
1.3610.1%
 
1.3710.1%
 
1.4210.1%
 
1.4710.1%
 
ValueCountFrequency (%) 
25.410.1%
 
24.7410.1%
 
23.6910.1%
 
19.0310.1%
 
16.6110.1%
 

GDP Growth
Real number (ℝ)

MISSING

Distinct count622
Unique (%)95.8%
Missing23
Missing (%)3.4%
Infinite0
Infinite (%)0.0%
Mean1.6062232191587056
Minimum-8.227
Maximum10.062000000000001
Zeros0
Zeros (%)0.0%
Memory size5.2 KiB

Quantile statistics

Minimum-8.227
5-th percentile-2.9712
Q10.12
median1.511
Q33.518
95-th percentile5.3508
Maximum10.062
Range18.289
Interquartile range (IQR)3.398

Descriptive statistics

Standard deviation2.614801556
Coefficient of variation (CV)1.627919162
Kurtosis0.9582746093
Mean1.606223219
Median Absolute Deviation (MAD)1.619
Skewness-0.2629142953
Sum1042.438869
Variance6.837187179
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.4440.6%
 
0.9930.4%
 
2.48620.3%
 
0.93620.3%
 
-0.4920.3%
 
3.07720.3%
 
3.01620.3%
 
0.20920.3%
 
-0.1520.3%
 
-0.3320.3%
 
Other values (612)62693.2%
 
(Missing)233.4%
 
ValueCountFrequency (%) 
-8.22710.1%
 
-6.99510.1%
 
-6.88810.1%
 
-6.7310.1%
 
-5.94110.1%
 
ValueCountFrequency (%) 
10.06210.1%
 
10.0210.1%
 
9.77210.1%
 
9.29810.1%
 
8.92110.1%
 

VIX Index
Real number (ℝ≥0)

MISSING

Distinct count55
Unique (%)8.3%
Missing12
Missing (%)1.8%
Infinite0
Infinite (%)0.0%
Mean22.4761990800303
Minimum11.40828125
Maximum62.54369231
Zeros0
Zeros (%)0.0%
Memory size5.2 KiB

Quantile statistics

Minimum11.40828125
5-th percentile12.71242424
Q116.09276923
median21.15888889
Q326.5959375
95-th percentile36.36454545
Maximum62.54369231
Range51.13541106
Interquartile range (IQR)10.50316827

Descriptive statistics

Standard deviation8.793544995
Coefficient of variation (CV)0.391238081
Kurtosis6.394061394
Mean22.47619908
Median Absolute Deviation (MAD)5.10888889
Skewness2.016035645
Sum14834.29139
Variance77.32643359
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
17.21015625243.6%
 
16.05121.8%
 
14.00307692121.8%
 
26.5959375121.8%
 
28.35121.8%
 
21.9121.8%
 
28.78430769121.8%
 
26.8153125121.8%
 
13.99318182121.8%
 
24.70092308121.8%
 
Other values (45)52878.6%
 
ValueCountFrequency (%) 
11.40828125121.8%
 
12.42876923121.8%
 
12.71242424121.8%
 
13.13777778121.8%
 
13.1653125121.8%
 
ValueCountFrequency (%) 
62.54369231121.8%
 
46.07409091121.8%
 
36.36454545121.8%
 
34.33333333121.8%
 
33.120.3%
 

Public Deficit
Real number (ℝ)

MISSING

Distinct count497
Unique (%)76.7%
Missing24
Missing (%)3.6%
Infinite0
Infinite (%)0.0%
Mean-2.1106481481481483
Minimum-95.36
Maximum9.04
Zeros2
Zeros (%)0.3%
Memory size5.2 KiB

Quantile statistics

Minimum-95.36
5-th percentile-9.9925
Q1-4.3275
median-1.795
Q30.705
95-th percentile5.08
Maximum9.04
Range104.4
Interquartile range (IQR)5.0325

Descriptive statistics

Standard deviation5.814405062
Coefficient of variation (CV)-2.754795993
Kurtosis101.5944814
Mean-2.110648148
Median Absolute Deviation (MAD)2.52
Skewness-6.584144346
Sum-1367.7
Variance33.80730623
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.1340.6%
 
-4.6640.6%
 
0.4540.6%
 
-4.6430.4%
 
-3.8930.4%
 
-0.5330.4%
 
-0.6430.4%
 
0.2930.4%
 
4.5530.4%
 
-2.7530.4%
 
Other values (487)61591.5%
 
(Missing)243.6%
 
ValueCountFrequency (%) 
-95.3610.1%
 
-19.7710.1%
 
-19.6510.1%
 
-16.8310.1%
 
-16.310.1%
 
ValueCountFrequency (%) 
9.0410.1%
 
8.6910.1%
 
8.4810.1%
 
7.9310.1%
 
7.9210.1%
 

Unemployment Rate
Real number (ℝ≥0)

MISSING

Distinct count145
Unique (%)22.0%
Missing14
Missing (%)2.1%
Infinite0
Infinite (%)0.0%
Mean8.39352583586626
Minimum1.8
Maximum27.4
Zeros0
Zeros (%)0.0%
Memory size5.2 KiB

Quantile statistics

Minimum1.8
5-th percentile3.3
Q15.625
median8
Q39.8
95-th percentile15.49
Maximum27.4
Range25.6
Interquartile range (IQR)4.175

Descriptive statistics

Standard deviation4.087027743
Coefficient of variation (CV)0.4869262122
Kurtosis5.498488961
Mean8.393525836
Median Absolute Deviation (MAD)2
Skewness1.86207248
Sum5522.94
Variance16.70379577
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
7.7192.8%
 
8.7182.7%
 
8.1182.7%
 
4.6162.4%
 
7.9162.4%
 
8.5152.2%
 
8.4142.1%
 
8.3142.1%
 
4.7131.9%
 
7.2131.9%
 
Other values (135)50274.7%
 
(Missing)142.1%
 
ValueCountFrequency (%) 
1.810.1%
 
1.920.3%
 
230.4%
 
2.110.1%
 
2.230.4%
 
ValueCountFrequency (%) 
27.410.1%
 
26.620.3%
 
26.420.3%
 
26.120.3%
 
25.0210.1%
 

Credit To Private Sector
Real number (ℝ≥0)

MISSING

Distinct count611
Unique (%)99.8%
Missing60
Missing (%)8.9%
Infinite0
Infinite (%)0.0%
Mean708.3429166666666
Minimum16.17
Maximum2725.6
Zeros0
Zeros (%)0.0%
Memory size5.2 KiB

Quantile statistics

Minimum16.17
5-th percentile35.7865
Q1172.2405
median262.993
Q31157.34275
95-th percentile2504.735
Maximum2725.6
Range2709.43
Interquartile range (IQR)985.10225

Descriptive statistics

Standard deviation787.2255127
Coefficient of variation (CV)1.111362158
Kurtosis0.3458136313
Mean708.3429167
Median Absolute Deviation (MAD)198.739
Skewness1.267162906
Sum433505.865
Variance619724.0079
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2491.920.3%
 
108.35410.1%
 
57.50310.1%
 
162.72810.1%
 
197.6210.1%
 
252.49410.1%
 
636.68910.1%
 
182.50110.1%
 
269.7610.1%
 
1177.47510.1%
 
Other values (601)60189.4%
 
(Missing)608.9%
 
ValueCountFrequency (%) 
16.1710.1%
 
16.7410.1%
 
16.8610.1%
 
16.9810.1%
 
17.1610.1%
 
ValueCountFrequency (%) 
2725.610.1%
 
2722.710.1%
 
2720.610.1%
 
2716.410.1%
 
2713.510.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

yearquartercountryCurrent Account BalancePublic debtsovereign yieldsGDP GrowthVIX IndexPublic DeficitUnemployment RateCredit To Private Sector
020001Austria-0.8423490.7045.6853.52123.981562-2.168.2174.30
120002Austria-0.6681190.7085.5373.67525.908906-2.118.0177.85
220003Austria-0.7021350.7135.5783.73919.634062-1.837.9185.06
320004Austria-0.6691690.6625.3893.65126.815313-1.317.8187.33
420011Austria-1.9395360.6945.0872.89026.595938-0.607.7186.46
520012Austria-1.5691050.7005.3302.03524.7009230.007.7190.73
620013Austria0.2175920.6975.1331.20426.6118460.117.9192.45
720014Austria0.2186350.6684.8150.52028.784308-0.278.1194.40
820021Austria2.3379730.7105.1720.64622.150484-0.648.3194.15
920022Austria2.1088090.7005.3330.94122.336154-0.818.5195.35

Last rows

yearquartercountryCurrent Account BalancePublic debtsovereign yieldsGDP GrowthVIX IndexPublic DeficitUnemployment RateCredit To Private Sector
66220113Finland-0.498530.47202.733.11000032.237727-0.536.9173.091
66320114Finland-5.120180.57982.520.99000028.350000-0.927.0175.401
66420121Finland-1.696150.57612.311.44000020.000000-0.968.1177.510
66520122Finland-0.886690.57371.91-0.15000021.900000-0.728.8180.925
66620123Finland-0.982880.57271.64-1.08000016.050000-0.677.2182.501
66720124Finland-4.222560.53001.68-0.85319918.300000-2.347.9182.824
66820131Finland-2.065220.53851.72-0.16957717.700000-3.118.1NaN
66920132Finland-1.726730.54231.660.06591815.750000-2.008.1NaN
67020133Finland-1.434700.5453NaN-0.03800518.450000-2.058.1NaN
67120134FinlandNaNNaNNaNNaNNaNNaNNaNNaN